Hypergraph-Clustering Method Based on an Improved Apriori Algorithm

نویسندگان

چکیده

With the complexity and variability of data structures dimensions, traditional clustering algorithms face various challenges. The integration network science has become a popular field exploration. One main challenges is how to handle large-scale complex high-dimensional effectively. Hypergraphs can accurately represent multidimensional heterogeneous data, making them important for improving performance. In this paper, we propose hypergraph-clustering method dubbed “high-dimensional method” based on hypergraph partitioning using an improved Apriori algorithm (HDHPA). First, constructs association rule algorithm, where frequent itemsets existing in are treated as hyperedges. Then, different mined parallel obtain hyperedges with corresponding ranks, avoiding generation redundant rules mining efficiency. Next, use dense subgraph partition (DSP) divide into multiple subclusters. Finally, merge subclusters through sub-hypergraphs results. advantage lies its model discretize between space, which further enhances effectiveness accuracy clustering. We comprehensively compare proposed HDHPA several advanced methods seven types datasets then their running times. results show that evaluation index values generally superior all other methods. maximum ARI value reach 0.834, increase 42%, average time lower than All all, exhibits excellent comparable performance real networks. research paper provide effective solution processing analyzing also conducive broadening application range techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An improved opposition-based Crow Search Algorithm for Data Clustering

Data clustering is an ideal way of working with a huge amount of data and looking for a structure in the dataset. In other words, clustering is the classification of the same data; the similarity among the data in a cluster is maximum and the similarity among the data in the different clusters is minimal. The innovation of this paper is a clustering method based on the Crow Search Algorithm (CS...

متن کامل

A Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach

In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...

متن کامل

An Improved Apriori Algorithm for Association Rules

There are several mining algorithms of association rules. One of the most popular algorithms is Apriori that is used to extract frequent itemsets from large database and getting the association rule for discovering the knowledge. Based on this algorithm, this paper indicates the limitation of the original Apriori algorithm of wasting time for scanning the whole database searching on the frequen...

متن کامل

Medical Diagnosis Data Mining Based on Improved Apriori Algorithm

With the wide application of computer science and technology, the amount of data generated by various disciplines increased rapidly. In order to discover valuable knowledge in these databases, people use data mining methods to solve this problem. The application of association rule mining is an important research topic in data mining. As the association rule technology becomes more mature, it i...

متن کامل

Enterprise Human Resources Information Mining Based on Improved Apriori Algorithm

With the unceasing development of information and technology in today’s modern society, enterprises’ demand of human resources information mining is getting bigger and bigger. Based on the enterprise human resources information mining situation, this paper puts forward a kind of improved Apriori algorithm based model on the enterprise human resources information mining, this model introduced da...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app131910577